rank | frequency | n-gram |
---|---|---|
1 | 781 | -h |
2 | 727 | -a |
3 | 696 | -n |
4 | 446 | -t |
5 | 423 | -g |
rank | frequency | n-gram |
---|---|---|
1 | 406 | -ng |
2 | 335 | -an |
3 | 264 | -ah |
4 | 178 | -at |
5 | 162 | -ih |
rank | frequency | n-gram |
---|---|---|
1 | 132 | -ang |
2 | 129 | -jih |
3 | 66 | -ong |
4 | 53 | -ara |
5 | 51 | -lah |
rank | frequency | n-gram |
---|---|---|
1 | 37 | -uëng |
2 | 28 | -euët |
3 | 22 | -nyan |
4 | 21 | -rang |
5 | 21 | -ajih |
rank | frequency | n-gram |
---|---|---|
1 | 26 | -euëng |
2 | 9 | -intah |
3 | 9 | -umpok |
4 | 9 | -nyang |
5 | 9 | -unong |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings